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Abstract — Correlated scattering occurs naturally in frequency- 
selective fading channels and its impact on the performance 
needs to be understood. In particular, we answer the question 
whether the uncorrelated scattering model leads to an optimistic 
or pessimistic estimation of the actual average capacity. In the 
paper, we use majorization for functions to show that the average 
rate with perfectly informed receiver is largest for uncorrelated 
scattering if the transmitter is uninformed. If the transmitter 
knows the channel statistics, it can exploit this knowledge. We 
show that for small SNR, the behavior is opposite, uncorrelated 
scattering leads to a lower bound on the average capacity. Finally, 
we provide an example of the theoretical results for an attenuated 
Ornstein-Uhlenbeck process including illustrations. 

I. Introduction 

The ergodic capacity of a single-user multipath fading 
channel with slow fading is well known for the case when 
the receiver has perfect channel state information (CSI) [1], 
[2], [3]. The resource allocation for such channels - discrete 
and continuous time - is studied extensively. Bit and power 
loading as well as rate adaptation for single- [4], [5] and multi- 
user systems [6], [7] is performed under different quality-of- 
service (QoS) requirements and under different assumptions 
on the channel state information (CSI) at the transmitter. The 
single-user case with perfect CSI at the transmitter and receiver 
leads to spectral water filling power allocation [8]. Adaptation 
to long-term CSI is proposed in [9] under average and outage 
QoS constraints. The multi-antenna multi-carrier channel is 
analyzed from an information-theoretic perspective in [10]. 

Obviously, the average achievable rate depends not only 
on the CSI but also on the channel statistics. Often, an 
uncorrelated scattering channel is assumed. However, this 
assumption does mostly not apply to ultra-wideband (UWB) 
channels. Furthermore, correlation occurs if transceiver filters 
are taken into account, even for an uncorrelated scattering 
channel [11]. 

Recently, the achievable average rate for a single-user 
channel with correlated scattering was studied for the single 
antenna case in [12], [11], [13] and the multiple antenna case 
in [14]. The results indicate that the tap correlation decreases 
the performance if no CSI is available at the transmitter. If 
CSI is available the behavior depends on the signal-to-noise 
ratio (SNR) whether the performance is increased (low SNR) 
or decreased (high SNR) by tap correlation [11], [15]. 



The main contribution of this paper is the non-trivial ex- 
tension of the results from [15] to the continuous -time case. 
A different notion of majorization for function is used. The 
theory is illustrated by a concrete example using the attenu- 
ated Ornstein-Uhlenbeck process, i.e., exponentially decaying 
power and correlation. 

Note that majorization for functions is applied in communi- 
cation theory before in [16]. There the order is used to compare 
spectra of eigenvalues of Wishart matrices in the context of 
MIMO systems. In the current paper, we use the order to 
compare correlation scenarios of frequency selective channels. 

II. Channel model and capacity formulas 

We consider a single-user single-antenna frequency selective 
slowly fading channel, continuous in time and frequency 
domain. We assume an average power constraint P on the 
channel input and perfect CSI at the receiver. The noise at 
the receiver is additive white Gaussian with power spectral 
density N . The channel model employed below is an extended 
version of [10] and is described in [13] in detail. 

Let (X T ), (Y T ), t <E R, be real i.i.d. second order Gaussian 
processes with zero mean and continuous covariance function 
R with R(T,T r )dTdT' < oo. By the complex ran- 

dom process 



H = (H T ) = (X T + jY T ), tg 



(1) 



we model the continuous-time channel impulse response (CIR) 
of a blockfading multipath channel in lowpass-equivalent 
form with Rayleigh-distributed magnitudes. The mean en- 
ergy contained in (H T ) ist given by the constant c = 
2 R(t, t) dr, which we use later for normalization. Note 
that uncorrelated scattering can be modeled by choosing 
R(t,t') — g(i~)g(T')5(T — t'),t 7 t' £ R, with g satisfying 
J Too 5 2 ( r )^ T < 00 an d ^ being the Dirac delta distribution. 

The average rate within the frequency band (— W/2, W/2) 
is calculated in [nats/s] by [2], [1] 



C(H,p,p,W)=E 



W/2 
■W/2 



log (l + pp(f)\H f \ 2 )df 



where H = (H f ) = (j^ H r dr) , f 

Fourier transform of the process (H T ). 



(2) 



is the 



The function / ^ p{f), f e {-W/2, W/2), is the spectral 
power allocation function with p(f) > and with power con- 
straint $^wi 2 p(f)df — 1. In later calculations we reasonably 
assume p to be continuous. By p = N ^ w we denote the 
average SNR. 

The type of integral to calculate H and C(H, p,p,W) is 
a stochastic Riemann-integral. The assumptions made for the 
channel model, particularity for the covariance function R, 
ensure the existence of all involved quantities as shown in 
[13]. To evaluate |2} we need to calculate these integrals. 
Fortunately, we are allowed to exchange expectation operator 
and integration, which is a property of the stochastic Riemann- 
integral. The result is derived in [13] and can be rewritten in 
integral form as 



C(a,p,p,W) 



W/2 



W/2 



E z [log(l + pp(f)a(f)z)]df, (3) 



where z is an exponentially distributed random variable with 
expected value oneQand 



*(/) 



E 



/ R(t,t / )cos(2tt(t -T')f)dTdr' 

-OO J — OO 



(4) 



for / e (- W/2, W/2). We explicitly note that C(a,p,p,W) 
is finite for all valid parameters. Note that a is continu- 
ous and differentiable, which follows from the properties of 
R and from the First Fundamental Theorem of Calculus. 
Furthermore, a(f) > and f™^, 2 <*(f)df < oo due to 
Parseval's Theorem. Further note that u is constant in case of 
uncorrected scattering. Subsequent analysis requires to rewrite 
d3l as 



C(a,p,p,W) = W E z [\og(l + pa(f)p(f)z)]df (5) 
Jo 

with the scaled and shifted functions cr(/) = a(W(f — 1/2)), 
p(J) = p(W(f - 1/2)), / S (0,1), with power constraint 
Jo P(f)df = Subsequently, we want to refer to a or a as 
the spectral fading variance (function). 

III. Preliminaries 

We review necessary basic definitions from [17] and results 
from [18]. Let a; be a real measurable function on (0,1). The 
distribution function of x is given by 

d x (s) =n{{t'. x(t) >s}),seR, 

where p is the Lebesgue measure. The distribution function 
is nonincreasing and right-continuous. One says that two 
functions x,y are equivalent in distribution if d x — d y . The 
right-continuous inverse of d x is defined by 

x*(t) = inf{s : d x (s) < t}, t 6(0,1). 

It is nonincreasing and is called the decreasing rearrangement 
of x. The functions x, x* are equally integrable or (non- 
integrable) and their integrals are related by J Q x*(t)dt = 

1 In the rest of the paper we use the variable z in this meaning. 



Jg 1 x(t)dt and J Q S x*(t)dt > x(t)dt. We observe that non- 
negativity and continuity of x implies non-negativity and 
continuity of x*. 

Assuming x E L\ where L\ = Li(0,l) is the space of 
integrable functions on (0, 1), [19] showed that the original 
function x can be recovered from x* with x(t) — x*(<fi(t)) by 

(j)(s) = p{t : x(t) > x(s)} + p{t< s : x(t) = x(s)}. (6) 

Next, we define the partial order on functions x,y € L\. 
The following definitions can be found in [17, Def. 1.2]. 
Majorization for functions is also discussed in [20, Sec. 2.1.3]. 



Definition 1 Let x,y G L\. We say that x majorizes y and 
write x y y if 



x*{t)dt> / y*(t)dt 
it Jo 



for all s with < s < 1 and 



x*(t)dt= [ y*(t)dt. 



(7) 



(8) 



Remark: One may conclude there always exists an e > such 
that for continuous x,y we have x*(t) > y*(t) for t £ (0,e). 
This is not true in general. However, if we additionally require 

x*(0+) >y*(0+), 

where x*(0+) = lim t |o x*(t), then there exists an e > 0, such 
that x*(t) > y*(t) for t e (0, e). This additional requirement 
we will use in Theorem IIV.2I 

The order-preserving functionals with respect to majoriza- 
tion are called Schur-convex and Schur-concave. 

Definition 2 A real-valued functional (f> defined on A C L\ 
is said to be Schur-convex on A if for all x,y S A with x >z y 
it follows that <p(x) > 4>(y). 

If —(f) is Schur-convex on A then (j) is Schur-concave on A 

The following result is used later. It can be found in [18] and 
was first proved in [21]. The corresponding result on vectors 
is given in [22, Prop. 3.C.1]. 

Lemma III.l Let x,y € L\ with x y y. If g is a real- 
valued, concave function on an interval I C M such that 
g o x, g o y s L\, then 



(gox)(t)dt< f (goy)(t)dt. 



(9) 



In order to compare different correlation scenarios, we can use 
majorization for functions. 

Definition 3 We say that a channel with correlation func- 
tion R\ and corresponding spectral fading variance <j\ is 
more correlated than a channel with correlation function R2 
and corresponding spectral fading variance 02 if o~\ >z 02. 



IV. Average capacity characterizations 

In this section, we present our main results. Two scenarios 
are studied, namely the case in which the transmitter has no 
CSI and the case in which it knows the spectral fading variance 
function a. 

A. No CSI at transmitter 

If the transmitter has no CSI, the most reasonable strategy 
is to apply equal power allocation, i.e. p(f) = 4^, / € (0, 1). 
The average rate is then given by 



C no (a,p, W)=W 



log 1 



W 



df. (10) 



Theorem IV.l The functional C no given in ( |70p with no CSI 
at the transmitter is Schur-concave with respect to the spectral 
fading variance a for all ^ > Q, i.e., a\ > a 2 implies 

Cno(ci) < C no (a 2 ). 



Proof: We apply Lemma III. 1 1 to prove the theorem. 
For all a > 0, the function s 1— ► log (1 + as) is concave 
on M. + and thus s 1— > log (l + ^sz(w)) is concave on R + 
for each realization z(w) of z. Since the Lebesgue-integral is 
monotonic (and thus the expectation operator), 

P 



log 1 



W 



is concave on M + . Since C no (cr, p, W) = W L ((f>oa)(f)df is 
finite under the assumptions made, as noted in Section [III] and 
proved in [13], we can apply the Lemma [IV. 1 | which completes 
the proof. ■ 
Remark: Assume two channels with spectral fading variance 
o\ and a 2 and further assume <7i(/) > 0, 02(f) > 0, / G 
(0, 1). For high average SNR, i.e., high values of p, the average 
rate C DO (a, p, W) can be approximated by committing the 1 
in the logarithm in ( [T0| to obtain 



C no (a,p, W) = W 



E 2 



log (^a(f)z)]df 



(11) 



We use this approximation to calculate the difference of the 
average rates for fading variances o\ and a 2 for high SNR 
and obtain 

AC no (ai,a 2 ) = C no (ai,p,W) - C no (a 2 ,p,W) 

The equation is the analogue to equation (19) in [15]. 

B. Partial CSI at the transmitter 

If the transmitter knows the spectral fading variance, it 
can adapt the power allocation function p accordingly. The 
derivation of the optimal power allocation is similar to [23] 
and we obtain 

C part (a, p, W) = max W J E z [log (1 + pa(f)p(f)z)} df 
p 

s.t P (f)>o, f 1 P (f)df<^, /e(o,i). (13) 



We easily verify that the constraint set is convex. Furthermore, 
the functional p i-> W E z [log (1 + pa(f)p(f)z)]df is 
strictly concave on the constraint set, which is identically 



shown as in the proof of Theorem IV. 1 



Thus the Karush-Kuhn-Tucker (KKT) conditions are suffi- 
cient for a unique global maximum [24]. Evaluation of the 
KKT conditions gives the following characterization of the 
optimal solution p a : 



Po-(f)z 



1 + P°(f)Po(f)z 

^>0, A(/)>0, p o (/)A(/)=0, / 6(0,1), 



(14) 



where v and / 1— > A(/), / G (0, 1), are Lagrangian multipliers. 
Without loss of generality we can assume v > 0. 

The average capacity with a known at the transmitter is 
then given by 

C part (a, p,W) = W [ E z [log (1 + pa(f) Po (f)z)] df. (15) 
Jo 

Remark: We can equivalently use the decreasing rearrange- 
ment a* of a to calculate the capacity. We simply replace 
a by a* and p by p* in (13i, (14i, and ( 15 1. Then, the 



optimal power allocation p a belonging to a can be obtained 
from the optimal power allocation p* belonging to a* by 
Po(f) = Pi (4>(f)), f G (0, 1), with 4> as in (§>. 
In the following we consider the quantity 

0(a, p,W) = p ({/ : Po (f) >0}) = P ({/ : P* (f) > 0}) , (16) 

i.e., the Lebesgue-measure of the support of p a and call it the 
volume of active frequencies. 

Lemma IV.l For given fading variance a and bandwidth W, 
the function p 1— > 6(a,p,W) is strictly monotonic increasing 
on [0, 00). In particular, for given e > there exists a unique 
/5(e) such that 9(a, p, W) < e for all p < /5(e). 

Proof: We define the function 



i>(x) = E 2 



1 + xz 
e^Ei^l/x) 



, x G [0,oo) (17) 



with Eii(y) = — Ei(— y), where Ei is the exponential integral 
[25, Ch. 5.1]. The function is strictly convex and strictly 
monotonic decreasing with ^(0) = 1 and lim^^oo ip(x) = 0. 
These properties hold for the function x 1— * x + zx , x G [0, 00), 
for all z > (verified with first and second derivative) and 
hold also for ip due to the monotonicity of the Lebesgue- 
integral. 

The properties of ip ensure the existence of the inverse 
function ip -1 , which is defined on (0, 1]. The inverse ip^ 1 is 
strictly convex, strictly monotonic decreasing and ^^(x) > 
for x G (0, 1). We extend tp^ 1 by defining ijj^ 1 (x) — for 
x G (1, 00). 

For a > consider now the function x 1— > C, a (x) — 
xip~ 1 (ax) for x G (0, —] and ( a (x) = for x G (i,co). It is 



not difficult to show that ( a is strictly monotonic decreasing 
on (0, i] with Ca(0+) = 1 and - 0. 

From the optimality condition in ( p"4} it follows 



Po(/) = C, 



1 



(18) 



for all / G (0, 1) for which p*(/) > 0. 

Since er* is monotonically decreasing on (0, 1) it follows 



that / ^ ( v 



is monotonically decreasing on (0, 1) 



for constant ^ > and p > 0. 

If £„ pa-* (i) ) > we define /* = 1 and otherwise we 
define /* to be the smallest / such that v = pa*(f). All 
frequencies f < f* are active and all other frequencies are not. 
Thus, we have 9(a, p, W) = /*. From the monotonicity of £„ 
it easily follows, that /* increases for increasing p if v and 
a* are fixed. However, since v is the parameter guaranteeing 
the condition J Q p*(f)df — ^ to hold, it is also a function of 
a* and p. Thus, to show that /* increases with increasing p, 
we finally have to show that v ^ a decreases with increasing 

p. m 

In case of known a at the transmitter, we obtain the 
following theorem characterizing the impact of correlation for 
low SNR. 

Theorem IV.2 Let g\ and a 2 be fading variances with o\ y 
c 2 and (T*(0+) > cr^O-h). Then for given bandwidth W there 
exists a p > such that 

Cpart ("I) Pi W) > C palt (<T2,p,W) 

for all p < p. 

Proof: Since o\ and a 2 are continuous and o"i(0+) > 
cr 2(0+), there is an e > such that cr*(/) > cr^if) 
for / G (0, e). We take the largest possible e with this 



property. According to Lemma IV. 1 there exists pi{e) such that 



V. Example and illustrations 

A. Definition of channel model 

As in [13], we consider an exponentially attenuated 
Ornstein-Uhlenbeck process to illustrate the results. The chan- 
nel model in time-domain, given by ([TJ, is then described by 
the covariance function for the real and imaginary part 

R(t,t') = ce- a \ T - T \e- b ^+ T 'h {rm (T)l {T , m (T'), (20) 

for r, t' G R, where c > is the normalization constant 
introduced in Section [TTJ The covariance function captures an 
exponential power decay controlled by parameter b. In addi- 
tion, the correlation decays exponentially with delay separation 
controlled by parameter a. 

B. Fading variance specification 

The spectral correlation function given by Q can be com- 
puted in closed form [13] 

2cd 

&d{f) = d 2 + {2lTf) 2 > f e (-W2, W/2), (21) 

with the parameter d = a + b. Note that the power and 
correlation decay parameters have the same impact on the 
fading variance, since they occur in (f2T) only as sum. We 
choose the parameter c such that j_yf/2&{f)df = 1> i- e -> 
c = , w s . Further, we require the shifted and scaled 

version of a d , given by a d (f) = a d (W(f - 1/2)), / G (0, 1). 
Due to the symmetry of a d we easily obtain the decreasing 
rearrangement 

arctan [^f-) d 2 + (nWf) 
For the considered example, we obtain the following result, 



(Hi that for large SNR, the optimal power allocation is equal 
power allocation and the behavior with known fading variance 
at the transmitter is identical to the no CSI scenario. 



which allows us to apply Theorem IV. 1 and IV.2 



6(<Ti,p,W) < e for p < pi(e), i = 1,2. Since cr|(/) > a* 2 {f) 
for / e (0, e), we have pi(e) > /02(e). Choosing p G (0, /02(e)) 
we can rewrite ( fl3] l as 

Cp^ (<r, P, = maxW /„' E z [log (1 + pa* (f)p* (f)z)} df 

s.t.p*(/)>0, / V(/)#<i / 6(0,1). 

Again using cr*(/) > cr|(/) on (0, e) yields 

E[log(l + pat(f)p*(f)z)} >E[log(l+pa*2(f)p*(f)z)} (19) 

for all / £ (0,e), p G (0, /02(e)), and valid p*. Applying 
J^(-)df and max p (-) on both sides of ( |l9| > does not change 
the order of the inequality which completes the proof. ■ 
We remark that for small SNR, the effect of correlation with 
known fading variance is opposite compared to the no CSI 
scenario. It can be shown using the same approximation as in 



Lemma V.l Let a dl and a d2 be the fading variances for 
parameters d± , c?2 > 0. If d\ < d2 then a dl ^ cr d2 . Further, 
we have 0^(0+) > ^ 2 (0+) and a* dl {f) = o^(/) for 



f 



where A x — arctan 




dnA n 



diA a 



diA a 



(23) 



Proof: We give here only the sketch of the proof since 
the derivations are tedious and due to lack of space. First, 
we compute <j,(0+) — — — * . w , and then show that the 

d 3,rct£Lii( — ^ — J 

function d 1— > a' d (0+),d <E (0,oo), is strictly monotonic 
decreasing using its first derivative. This implies a di (0+) > 
a d (0+). Equation (23 1 is directly obtained by calculating the 
solution of the equation ct^ (/) = cr d2 (/)■ 

To prove that a dl y a d2 we show that for all s G [0, 1) the 
function £ s with 

arctan (^p) 



Ucl) = / o* d (f)df 
Jo 



W arctan (^) 



, de (0,00), (24) 



is monotonically decreasing using its first derivative and that 

£i(d) = 4. " ' ~ ■ 



C. Illustrations 

We use W = 1 in all subsequent simulations. Fig. [T] 
shows the decreasing rearrangements of three spectral fading 



variances illustrating the results of Lemma V.l We observe 




Fig. 1. Decreasing rearrangement of spectral fading variances. 

that with increasing d the rearranged fading variance is more 
spread out, which fits well with the notion of majorization. 




Fig. 2. Average capacity with no CSI at transmitter. 

Fig. [2] shows that the average rate C no with no CSI at the 
transmitter increases with increasing d for all values of p, 
which follows from Lemma IVTI and Theorem II V. 1 1 




Fig. 3. Average capacity with known fading variance at transmitter. 



Finally, Fig. [3] illustrates the result from Theorem IV.2 and 
Lemma |I V. 1 1 The high SNR behaviour is identical to the case 
with no CSI, for low SNR it is reversed. 
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